Development of Kannada Speech Corpus for Continuous Speech Recognition
نویسندگان
چکیده
منابع مشابه
An Amharic speech corpus for large vocabulary continuous speech recognition
• has rich morphology -> many word forms. Phonetics Amharic has a set of 38 phones, seven vowels and thirty-one consonants. Consonants Manner Voicing Place of Articulation of Art/n Lab Dent Pal Vel Glo Stops Voiceless p[p] t[t] m[t∫ ] k[k] [?] Voiced b[b] d[d] ¥[d ] g[g] GlottalizedÍ[p‘] μ[t‘] 1⁄2[t∫ ‘]q[q] Rounded [kw], [gw], [qw] Fricatives Voiceless f[f] s[s] ][∫ ] h[h] Voiced z[z] [ ] Glo...
متن کاملDevelopment of Large Vocabulary Continuous Speech Recognition Using Phonetically Structured Speech Corpus
This paper presents the results of acoustic modeling used in a Large Vocabulary Continuous Speech Recognition (LVCSR) system designed with the use of a phonetically controlled large vocabulary corpus. Evaluation experiments showed that relatively good speech recognition results may be obtained with adequate training material, taking into account: a) the presence of lexical stress; b) speech sty...
متن کاملDesign and recording of Czech speech corpus for audio-visual continuous speech recognition
In this paper we describe the design, recording, and content of a large audio-visual speech database intended for training and testing of audio-visual continuous speech recognition systems. The UWB05-HSCAVC database contains high resolution video and quality audio data suitable for experiments on audio-visual speech recognition. The corpus consists of nearly 40 hours of audiovisual records of 1...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملContinuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal model...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2018
ISSN: 0975-8887
DOI: 10.5120/ijca2018917255